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1 Introduction 

For most of its history, mathematics was algorithmic in nature. The geometric 
claims in Euclid's Elements fall into two distinct categories: "problems," which 
assert that a construction can be carried out to meet a given specification, and 
"theorems," which assert that some property holds of a particular geometric 
configuration. For example, Proposition 10 of Book I reads "To bisect a given 
straight line." Euclid's "proof" gives the construction, and ends with the (Greek 
equivalent of) Q.E.F., for quod erat faciendum, or "that which was to be done." 
Proofs of theorems, in contrast, end with Q.E.D., for quod erat demonstran- 
dum, or "that which was to be shown"; but even these typically involve the 
construction of auxiliary geometric objects in order to verify the claim. 

Similarly, algebra was devoted to developing algorithms for solving equa- 
tions. This outlook characterized the subject from its origins in ancient Egypt 
and Babylon, through the ninth century work of al-Khwarizmi, to the solutions 
to the quadratic and cubic equations in Cardano's Ars Magna of 1545, and to 
Lagrange's study of the quintic in his Reflexions sur la resolution algebrique des 
equations of 1770. 

The theory of probability, which was born in an exchange of letters between 
Blaise Pascal and Pierre de Fermat in 1654 and developed further by Christian 
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Huygens and Jakob Bernoulli, provided methods for calculating odds related to 
games of chance. Abraham de Moivre's 1718 monograph on the subject was 
entitled The Doctrine of Chances: or, a Method for Calculating the Probabil- 
ities of Events in Play. Pierre de Laplace's monumental Theorie analytique 
des probabilites expanded the scope of the subject dramatically, addressing sta- 
tistical problems related to everything from astronomical measurement to the 
measurement in the social sciences and the reliability of testimony. Even so, the 
emphasis remained fixed on explicit calculation. 

Analysis had an algorithmic flavor as well. In the early seventeenth century, 
Cavalieri, Fermat, Pascal, and Wallis developed methods of computing "quadra- 
tures," or areas of regions bounded by curves, as well as volumes. In the hands of 
Newton, the calculus became a method of explaining and predicting the motion 
of heavenly and sublunary objects. Euler's Introductio in Analysis Infinitorum 
of 1748 was the first work to base the calculus explicitly on the notion of a 
function; but, for Euler, functions were given by piecewise analytic expressions, 
and once again, his focus was on methods of calculation. 

All this is not to say that all the functions and operations considered by 
mathematicians were computable in the modern sense. Some of Euclid's con- 
structions involve a case split on whether two points are equal or not, and, 
similarly, Euler's piecewise analytic functions were not always continuous. In 
contrast, we will see below that functions on the reals that are computable in 
the modern sense are necessarily continuous. And even though Euler's work is 
sensitive to the rates of convergence of analytic expressions, these rates were 
not made explicit. But these are quibbles, and, generally speaking, mathemati- 
cal arguments through the eighteenth century provided informal algorithms for 
finding objects asserted to exist. 

The situation changed dramatically in the nineteenth century. Galois' the- 
ory of equations implicitly assumed that all the roots of a polynomial exist 
somewhere, but Gauss' 1799 proof of the fundamental theorem of algebra, for 
example, did not show how to compute them. In 1837, Dirichlet considered the 
example of a "function" from the real numbers to the real numbers which is 
equal to 1 on the rationals and on the irrationals, without pausing to consider 
whether such a function is calculable in any sense. The Bolzano- Weierstrafi 
Theorem, first proved by Bolzano in 1817, asserts that any bounded sequence 
of real numbers has a convergent subsequence; in general, there will be no way 
of computing such a subsequence. Riemann's proof of the open mapping the- 
orem was based on the Dirichlet principle, an existence principle that is not 
computationally valid. Cantor's work on the convergence of Fourier series led 
him to consider transfinite iterations of point-set operations, and, ultimately, to 
develop the abstract notion of set. 

Although the tensions between conceptual and computational points of view 
were most salient in analysis, other branches of mathematics were not immune. 
For example, in 1871, Richard Dedckind defined the modern notion of an ideal 
in a ring of algebraic integers, and defined operations on ideals in a purely 
extensional way. In other words, the operations were defined in such a way 
that they do not presuppose any particular representation of the ideals, and 
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definitions do not indicate how to compute the operations in terms of such 
representations. More dramatically, in 1890, Hilbert proved what is now known 
as the Hilbert Basis Theorem. This asserts that, given any sequence /i, /2j/3, ••• 
of multivariate polynomials over a Noetherian ring, there is some n such that for 
every m > n, f m is in the ideal generated by fi,...,f n . Such an m cannot be 
computed by surveying elements of the sequence, since it is not even a continuous 
function on the space of sequences; even if a sequence x,x,x,... starts out 
looking like a constant sequence, one cannot rule out the possibility that the 
clement 1 will eventually appear. 

Such shifts were controversial, and raised questions as to whether the new, 
abstract, set-theoretic methods were appropriate to mathematics. Set-theoretic 
paradoxes in the early twentieth century raised the additional question as to 
whether they are even consistent. Brouwer's attempt, in the 1910's, to found 
mathematics on an "intuitionistic" conception raised a further challenge to mod- 
ern methods, and in 1921, Hermann Weyl, Hilbert's best student, announced 
that he was joining the Brouwerian revolution. The twentieth century Grund- 
lagenstreit, or "crisis of foundations," was born. 

At that point, two radically different paths were open to the mathematical 
community: 

• Restrict the methods of mathematics so that mathematical theorems have 
direct computational validity. In particular, restrict methods so that sets 
and functions asserted to exist are computable, as well as infinitary math- 
ematical objects and structures more generally; and also ensure that quan- 
tifier dependences are also constructive, so that a "forall-exists" statement 
asserts the existence of a computable transformation. 

• Expand the methods of mathematics to allow idealized and abstract oper- 
ations on infinite objects and structures, without concern as to how these 
objects are represented, and without concern as to whether the operations 
have a direct computational interpretation. 

Mainstream contemporary mathematics has chosen decisively in favor of the 
latter. But computation is important to mathematics, and faced with a noncon- 
structive development, there are options available to those specifically interested 
in computation. For example, one can look for computationally valid versions of 
nonconstructivc mathematical theorems, as one does in computable and compu- 
tational mathematics, constructive mathematics, and numerical analysis. There 
are, in addition, various ways of measuring the extent to which ordinary theo- 
rems fail to be computable, and characterizing the data needed to make them 
so. 

With Turing's analysis of computability, we now have precise ways of saying 
what it means for various types of mathematical objects to be computable, 
stating mathematical theorems in computational terms, and specifying the data 
relative to which operations of interest are computable. Section [2] thus discusses 
computable analysis, whereby mathematical theorems are made computationally 
significant by stating the computational content explicitly. 
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There are still communities of mathematicians, however, who are committed 
to developing mathematics in such a way that every concept and assertion has 
an implicit computational meaning. Turing's analysis of computability is useful 
here, too, in a different way: by representing such styles of mathematics in 
formal axiomatic terms, we can make this implicit computational interpretation 
mathematically explicit. Section [3] thus discusses different styles of constructive 
mathematics, and the computational semantics thereof. 

2 Computable analysis 
2.1 Prom Leibniz to Turing 

An interest in the nature of computation can be found in the seventeenth century 
work of Leibniz (see [47]). For example, his stepped reckoner improved on earlier 
mechanical calculating devices like the one of Pascal. It was the first calculating 
machine that was able to perform all four basic arithmetical operations, and it 
earned Leibniz an external membership of the British Royal Society at the age 
of 24. Leibniz's development of calculus is better known, as is the corresponding 
priority dispute with Newton. Leibniz paid considerable attention to choosing 
notations and symbols carefully in order to facilitate calculation, and his use 
of the integral symbol J and the d symbol for derivatives have survived to the 
present day. Leibniz's work on the binary number system, long before the advent 
of digital computers, is also worth mentioning. A more important contribution 
to the study of computation was his notion of a calculus ratiocinator, that is, a 
calculus of reasoning. Such a calculus, Leibniz held, would allow one to resolve 
disputes in a purely mathematical fashion^ 

The only way to rectify our reasonings is to make them as tangible 
as those of the Mathematicians, so that we can find our error at a 
glance, and when there are disputes among persons, we can simply 
say: Let us calculate, without further ado, to see who is right. 

His attempts to develop such a calculus amount to an early form of symbolic 
logic. 

With this perspective, it is not farfetched to see Leibniz as initiating a series 
of developments that culminate in Turing's work. Norbert Wiener has described 
the relationship in the following way [232 : 

The history of the modern computing machine goes back to Leibniz 
and Pascal. Indeed, the general idea of a computing machine is 
nothing but a mechanization of Leibniz's calculus ratiocinator. It 
is, therefore, not at all remarkable that the theory of the present 
computing machine has come to meet the later developments of the 
algebra of logic anticipated by Leibniz. Turing has even suggested 
that the problem of decision, for any mathematical situation, can 

1 Leibniz, The Art of Discovery, 1685 [2551 . 
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always be reduced to the construction of an appropriate computing 
machine. 

2.2 Prom Borel to Turing 

Perhaps the first serious attempt to express the mathematical concepts of a 
computable real number and a computable function on the real numbers were 
made by Emil Borel around 1912, the year that Alan Turing was born. Borel 
defined computable real numbers as followsll 

We say that a number a is computable if, given a natural number 

n, we can obtain a rational number that differs from a by at most 
1 

n ' 

Of course, before the advent of Turing machines or any other formal notion 
of computability, the meaning of the phrase "we can obtain" remained vague. 
But Borel provided the following additional information in a footnote to that 
phrase: 

I intentionally leave aside the practical length of operations, which 
can be shorter or longer; the essential point is that each operation 
can be executed in finite time with a safe method that is unambigu- 
ous. 

This makes it clear that Borel had an intuitive notion of an algorithm in 
mind. Borel then indicated the importance of number representations, and 
argued that decimal expansions have no special theoretical value, whereas con- 
tinued fraction expansions are not invariant under arithmetic operations and 
hence of no practical value. He went on to discuss the problem of determining 
whether two real numbers are equal: 

The first problem in the theory of computable numbers is the prob- 
lem of equality of two such numbers. If two computable numbers 
are unequal, this can obviously be noticed by computing both with 
sufficient precision, but in general it will not be known a priori. One 
can make clear progress in determining a lower bound on the dif- 
ference of two computable numbers, whose definitions satisfy known 
conditions. 

In modern terms, what Borel seems to recognize here is that although there 
is no algorithm that decides whether two computable real numbers are equal, 
the inequality relation between computable reals is, at least, computably enu- 
merable. He then discussed a notion of the height of a number, which is based 
on counting the number of steps needed to construct that number, in a certain 

2 All citations of Borel are from 1191 . which is a reprint of 1181 . The translations here are 
by the authors of this article; obvious mistakes in the original have been corrected. 
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way. This concept can be seen as an early forerunner of the concept of Kol- 
mogorov complexity. Borel considered ways that this concept might be utilized 
in addressing the equality problem. 

In another section of his paper, Borel discussed the concept of a computable 
real number function, which he defined as follows: 

We say that a function is computable if its value is computable for 
any computable value of the variable. In other words, if a is a 
computable number, one has to know how to compute the value of 
f(a) with precision — for any n. One should not forget that, by 
definition, to be given a computable number a just means to be 
given a method to obtain an arbitrary approximation to a. 

It is worth noting that Borel only demanded computability at computable 
inputs in his definition. His definition is vague in the sense that he did not indi- 
cate whether he had in mind an algorithm that transfers a method to compute 
a into a method to compute f(a) (which would later become known as Markov 
computability or the Russian approach) or whether he had in mind an algorithm 
that transfers an approximation of a into an approximation of f{ct) (which is 
closer to what we now call a computable function on the real numbers, under 
the Polish approach). He also did not say explicitly that his algorithm to com- 
pute / is meant to be uniform, but this seems to be implied by his subsequent 
observation: 

A function cannot be computable, if it is not continuous at all com- 
putable values of the variable. 

A footnote to this observation then indicates that he had the Polish approach 
in mindU 

In order to make the computation of a function effectively possible 
with a given precision, one additionally needs to know the modulus 
of continuity of the function, which is the [...] relation [...] between 
the variation of the function values with respect to the variation of 
the variable. 

Borel went on to discuss different types of discontinuous functions, including 
those that we now call Borel measurable. In fact, the entire discussion of com- 
putable real numbers and computable real functions is preliminary to Borel's 
development of measure theory, and the discussion was meant to motivate as- 
pects of that development. 

3 Hence Borel's result that computability implies continuity cannot be seen as an early 
version of the famous theorem of CeTtin, in contrast to what is suggested in [215| . The 
aforementioned theorem states that any Markov computable function is already effectively 
continuous on computable inputs and hence computable in Borel's sense, see sections [23] and 
[3] Figure[5l While this is a deep result, the observation that computable functions in Borel's 
sense are continuous is obvious. 
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2.3 Turing on computable analysis 



Turing's landmark 1936 paper [208] is titled "On computable numbers, with an 
application to the Entscheidungsproblem." It begins as follows: 

The "computable" numbers may be described briefly as the real 
numbers whose expressions as a decimal are calculable by finite 
means. Although the subject of this paper is ostensibly the com- 
putable numbers, it is almost equally easy to define and investigate 
computable functions of an integrable variable or a real or com- 
putable variable, computable predicates, and so forth. The funda- 
mental problems involved are, however, the same in each case, and 
I have chosen the computable numbers for explicit treatment as in- 
volving the least cumbrous technique. I hope shortly to give an 
account of the relations of the computable numbers, functions, and 
so forth to one another. This will include a development of the the- 
ory of functions of a real variable expressed in terms of computable 
numbers. According to my definition, a number is computable if its 
decimal can be written down by a machine. 

At least two things are striking about these opening words. The first is 
that Turing chose not to motivate his notion of computation in terms of the 
ability to characterize the notion of a computable function from N to N, or 
the notion of a computable set of natural numbers, as in most contemporary 
presentations; but, rather, in terms of the ability to characterize the notion 
of a computable real number. In fact, Section 10 is titled "Examples of large 
classes of numbers which are computable." There, he introduced the notion of a 
computable function on computable real numbers, and the notion of computable 
convergence for a sequence of computable numbers, and so on; and argued that, 
for example, e and n and the real zeros of the Bessel functions are computable. 
The second striking fact is that he also flagged his intention of developing a 
full-blown theory of computable real analysis. As it turns out, this was a task 
that ultimately fell to his successors, as we explain below. 

The precise definition of a computable real number given by Turing in the 
original paper can be expressed as follows: a real number r is computable if there 
is a computable sequence of 0's and l's with the property that the fractional 
part of r is equal to the real number obtained by prefixing that sequence with 
a binary point. There is a slight problem with this definition, however, which 
Turing discussed in a later correction 209 , published in 1937. Suppose we have 
a procedure that, for every i, outputs a rational number qi with the property 
that 

\r- qi \ <2"\ 

Intuitively, in that case, we would also want to consider r to be a computable 
real number, because we can compute it to any desired accuracy. In fact, it is not 
hard to show that this second definition coincides with the first: a real number 
has a computable binary expansion if and only if it is possible to compute 
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it in the second sense. In other words, the two definitions are extcnsionally 
equivalent. 

The problem, however, is that it is not possible to pass uniformly between 
these two representations, in a computable way. For example, suppose a pro- 
cedure of the second type begins to output the sequence of approximations 
1/2,1/2,1/2,.... Then it is difficult to determine what the first binary digit 
is, because at some point the output could jump just above or just below 1/2. 
This intuition can be made precise: allowing the sequence in general to depend 
on the halting behavior of a Turing machine, one can show that there is no al- 
gorithmic procedure which, given a description of a Turing machine describing 
a real number by a sequence of rational approximations, computes the digits 
of r. On the other hand, for any fixed description of the first sort, there is a 
computable description of the second sort: either r is a dyadic rational, which is 
to say, it has a finite binary expansion; or waiting long enough will always pro- 
vide enough information to determine the digits. So the difference only shows 
up when one wishes to talk about computations which take descriptions of real 
numbers as input. In that case, as Turing noted, the second type of definition 
is more natural; and, as we will see below, these are the descriptions of the 
computable reals that form the basis for computable analysis. 

Turing's second representation of computable reals, presented in his correc- 
tion |209j . is given by the formula 

oo 

(2i - l)n + ^(2cr - 1) 

r=l 

where i and n provide the integer part of the represented number and the binary 
sequence c r the fractional part. This representation is essentially what has later 
been called a signed-digit representation with base |. It is interesting that 
Turing acknowledged Brouwer's influence (see also |72j): 

This use of overlapping intervals for the definition of real numbers 
is due originally to Brouwer. 

In the 1936 paper, in addition to discussing individual computable real num- 
bers, Turing also defined the notion of a computable function on real numbers. 
Like Borel, he adopted the standpoint that the input to such a function needs 
to be computable itself: 

We cannot define general computable functions of a real variable, 
since there is no general method of describing a real number, but we 
can define a computable function of a computable variable. 

A few years later Turing introduced oracle machines [2 11) , which would have 
allowed him to handle computable functions on arbitrary real inputs, simply by 
considering the input as an oracle given from outside and not as being itself 
computed in some specific way. But the 1936 definition ran as follows. First, 
Turing extended his definition of computable real numbers 7„ from the unit 
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interval to all real numbers using the formula a n = tan(-7r(7„ — 5)). He went 
on: 

Now let <p(n) be a computable function which can be shown to be 
such that for any satisfactorily] argument its value is satisfactory. 
Then the function /, defined by f(a n ) = a v r n ), is a computable 
function and all computable functions of a computable variable are 
expressible in this form. 

Hence, Turing's definition of a computable function on the real numbers 
essentially coincides with Markov's definition, which would later form the ba- 
sis of the Russian approach to computable analysis (see sections 12.61 and 13. 5[) . 
Computability of / is not defined in terms of approximations, but in terms of 
functions ip that transfer algorithms that describe the input into algorithms that 
describe the output. There are at least three objections against the technical de- 
tails of Turing's definition, some of which have already been addressed by Guido 
Gherardi in [72], which is the first careful discussion of Turing's contributions 
to computable analysis: 

1 . Turing used the binary representation of real numbers in order to define j n , 
and hence the derived notion of a computable real function is somewhat 
unnatural and restrictive. However, we can consider this problem as being 
repaired by Turing's second approach to computable reals in [209] . which 
we have described above. 

2. Turing should have mentioned that the function / is only well-defined 
by f(a n ) — a,pt n ) if <p is extensional in the sense that a n = ak implies 
a ip(n) = cWfcV However, we can safely assume that this is what he had 
in mind, since otherwise his equation does not even define a single- valued 
function. 

3. Turing did not allow arbitrary suitable computable functions tp, but he 
restricted his definition to total functions ip. However, it is an easy exer- 
cise to show that for any partial computable p that maps all satisfactory 
numbers to satisfactory numbers there is a total computable function ip 
such that ip(n) and p(n) are numbers of the same total computable func- 
tion for all satisfactory inputs n. Hence the restriction to total tp is not 
an actual restriction. 

These three considerations support our claim that Turing's definition of a com- 
putable real function is essentially the same as Markov's definition. 

It is worth pointing out that Turing introduced another important theme 
in computable analysis, namely, developing computable versions of theorems of 
analysis. For instance, he pointed out: 

4 Turing calls a natural number n satisfactory if, in modern terms, n is a Godel index of a 
total computable function. 
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... we cannot say that a computable bounded increasing sequence of 
computable numbers has a computable limit. 

This shows that Turing was aware of the fact that the Monotone Convergence 
Theorem cannot be proved computably. He did, however, provide a (weak) 
computable version of the Intermediate Value Theorem: 

(vi) If a and /3 are computable and a < (3 and ip(ct) < < <p(/3), 
where <p(a) is a computable increasing continuous function, 
then there is a unique computable number 7, satisfying a < 
7 < (3 and 99(7) = 0. 

The fact that the limit of a computable sequence of computable real numbers 
need not be computable motivates the introduction of the concept of computable 
convergence: 

We shall say that a sequence /3„ of computable numbers converges 
computably if there is a computable integral valued function N(e) of 
the computable variable e, such that we can show that, if e > and 
n > N(e) and m > N(e), then \/3 n — (3 m \ < e. We can then show 
that 

(vii) A power series whose coefficients form a computable sequence 
of computable numbers is computably convergent at all com- 
putable points in the interior of its interval of convergence. 

(viii) The limit of a computably convergent sequence is computable. 

And with the obvious definition of "uniformly computably conver- 
gent" : 

(ix) The limit of a uniformly computably convergent computable 
sequence of computable functions is a computable function. 
Hence 

(x) The sum of a power series whose coefficients form a computable 
sequence is a computable function in the interior of its interval 
of convergence. 

From (viii) and ir — 4(1 — | + 1 — ...) we deduce that ir is computable. 
From e = 1 + 1 + + ^ + ... we deduce that e is computable. From 
(vi) we deduce that all real algebraic numbers are computable. From 
(vi) and (x) we deduce that the real zeros of the Bessel functions are 
computable. 

Here we need to warn the reader that Turing erred in (x): the mere com- 
putability of the sequences of coefficients of a power series does not guarantee 
computable convergence on the whole interior of its interval of convergence, but 
only on each compact subinterval thereof (see Theorem 7 in |174j and Exer- 
cise 6.5.2 in [224] ). 
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Before we leave Turing behind, some of his other, related work is worth 
mentioning. For example, in 1938, Turing published an article, "Finite Approx- 
imations to Lie Groups" [210] . in the Annals of Mathematics, addressing the 
question as to which infinite Lie groups can be approximated by finite groups. 
The idea of approximating infinitary mathematical objects by finite ones is cen- 
tral to computable analysis. The problem that Turing considered here is more 
restrictive in that he required that the approximants be groups themselves, and 
the paper is not in any way directly related to computability; but the paral- 
lel is interesting. What is more directly relevant is the fact that Turing was 
not only interested in computing real numbers and functions in theory, but in 
practice. The Turing archival contains a sketch of a proposal, in 1939, to build 
an analog computer that would calculate approximate values for the Riemann 
zeta-function on the critical line. His ingenious method was published in 1943 
|212j . and in a paper published in 1953 |214j he described calculations that 
were carried out in 1950 on the Manchester University Mark 1 Electronic Com- 
puter. Finally, we mention that, in a paper of 1948 |213j . Turing studied linear 
equations from the perspective of algebraic complexity. There he introduced 
the concepts of a condition number and of ill-conditioned equations, which are 
widely used in numerical mathematics nowadays. (For a discussion of this work, 
see Lenore Blum's contribution to this collection, as well as [75].) In some very 
interesting unpublished work Turing provided an algorithm to compute an ab- 
solutely normal real number (see the discussion by Becher et. al. in [T3]) and in 
this way he even made an early contribution to algorithmic randomness. 

2.4 Prom Turing to Specker 

Turing's ideas on computable real numbers where taken up in 1949 by Ernst 
Specker [201] . who completed his Ph.D. in Zurich under Hopf in the same year. 
Specker was probably attracted to logic by Paul Bernays, whom he thanks in 
that paper. Specker constructed a computable monotone bounded sequence 
(x n ) whose limit is not a computable real number, thus establishing Turing's 
claim. In modern terms one can easily describe such a sequence: given an 
injective computable enumeration / : N — > N of a computably enumerable but 
non-computable set K C N, the partial sums 

n 
t=0 

form a computable sequence without a computable limit. Such sequences are 
now sometimes called Specker sequences. 

It is less well-known that Specker also studied different representations of real 
numbers in that paper, including the representation by computably converging 
Cauchy sequences, the decimal representation, and the Dedekind cut represen- 
tation (via characteristic functions). In particular, he considered both prim- 
itive recursive and general computable representations in all three instances. 

5 See AMT/C/2 in http://www.turingarchive.Org/browse.php/C 
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His main results include the fact that the class of real numbers with primi- 
tive recursive Dedekind cuts is strictly included in the class of real numbers 
with primitive recursive decimal expansions, and the fact that the latter class 
is strictly included in the class of real numbers with primitive recursive Cauchy 
approximation. He also studied arithmetic properties of these classes of real 
numbers. 

Soon after Specker published his paper, Rosza Peter included Specker's dis- 
cussion of the different classes of primitive recursive real numbers in her book, 
Rekursive Funktionen [173] . In a review of that book |184) . Raphael M. Robin- 
son noted that all the representations of real numbers mentioned above yield 
the same class of numbers if one uses computability in place of primitive recur- 
siveness. However, this is no longer true if one moves from single real numbers 
to sequences of real numbers. Mostowski [161] proved that for computable se- 
quences of real numbers the analogous strict inclusions hold, as they were proved 
by Specker for single primitive recursive real numbers: the class of sequences of 
real numbers with uniformly decidable Dedekind cuts is strictly included in the 
class of sequences of real numbers with uniformly computable decimal expan- 
sion, which in turn is strictly included in the class of sequences of real numbers 
with computable Cauchy approximations. Mostowski pointed out that the strik- 
ing coincidence between the behavior of classes of single primitive recursive real 
numbers and computable sequences of real numbers is not very well understood: 

It remains an open problem whether this is a coincidence or a special 
case of a general phenomenon whose cause ought to be discovered. 

A few years after Specker and Robinson, the logician H. Gordon Rice [183j 
proved what Borel had already observed informally, namely that equality is 
not decidable for computable real numbers. In contrast, whether a < b or 
a > b holds can be decided for two non-equal computable real numbers a, b. 
Rice also proved that the set R c of computable real numbers forms a real alge- 
braically closed field, and that the Bolzano- Weierstrafi Theorem does not hold 
computably, in the following sense: there is a bounded c.e. set of computable 
numbers without a computable accumulation point. Kreisel wrote a review of 
Rice's paper for the Mathematical Reviews and pointed out that he had already 
proved results that are more general than Rice's; namely, he proved that any 
power series with a computable sequence of coefficients (that are not all identi- 
cal to zero) has only computable zeros in its interval of convergence )121j . and 
that there exists a computable bounded set of rational numbers which contains 
no computable subsequence with a computable modulus of convergence [120 . 

In a second paper on computable analysis [202] , Specker constructed a com- 
putable real function that attains its maximum in the unit interval at a non- 
computable number. In a footnote he mentioned that this solves an open prob- 
lem posed by Grzegorczyk [52] , a problem that Lacombe had already solved 
independently |137] . Specker mentioned that a similar result can be derived 
from a theorem of Zaslavskii |239j . He also mentioned that the Intermediate 
Value Theorem has a (non-uniform) computable version, but that it fails for 
sequences of functions. 
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2.5 Computable analysis in Poland and France 

In the 1950's, computable analysis received a substantial number of contribu- 
tions from Andrzej Grzegorczyk 82, 83, 84, 85 in Poland and Daniel Lacombe 

^MMM\^W^\IMMIi\^\^M&^M^M^M in France. 
Grzegorczyk is the best known representative of the Polish school of computable 
analysis, a school of research that came into being soon after Turing published 
his seminal paper. Apparently, members of the Polish school of functional anal- 
ysis became interested in questions of computability, and Stefan Banach and 
Stanislaw Mazur gave a seminar talk on this topic in Lviv (now in Ukraine, at 
that time Polish) on January 23, 1937 [TT]. Unfortunately, the second world 
war got in the way, and Mazur's course notes on computable analysis (edited 
by Grzegorczyk and Rasiowa) where not published until much later [1 54] . 

Banach and Mazur's notion of computability for functions on the real num- 
bers was defined using computable sequences. Accordingly, we call a function 
/ : R c — > R c Banach-Mazur computable if it maps computable sequences to 
computable sequences. While this concept works well for well-behaved func- 
tions (see, for example, the discussion of Pour-El and Richards' work on linear 
functions in Section |2.6[) . it yields a weaker concept of computability of real 
number functions in general, as proved later by Aberth (again, see Section |2"1?)) . 
One of Mazur's theorems was that every Banach-Mazur computable function 
is already continuous on the computable reals. This theorem can be seen as a 
forerunner and even as a stronger version of a theorem of Kreisel, Lacombe, and 
Shoenfield 123 , which was independently proved in slightly different versions 
by Ceitin [36] and later by Moschovakis [160) . Essentially, the theorem says 
that every real number function that is computable in the sense of Markov is 
continuous (on the computable reals). (In this particular case one even has the 
stronger conclusion that the function is effectively continuous. See Figure [5] in 
Section HH) 

Grzegorczyk's work [84j was mostly concerned with computable functions 
/ : [0, 1] — > R on all real numbers of the unit interval and he proved a number 
of equivalent characterizations of this concept. For instance / is computable if 
and only if each of the following conditions hold: 

1. / maps computable sequences to computable sequences and it has a com- 
putable modulus of uniform convergence, 

2. / can be represented by a computable function h on rational intervals 
that is monotone with respect to inclusion and that approximates / in the 
sense that {f(x)} = f]{h(I) '■ x £ 1} for all x e [0, 1]. 

Later on, the first characterization became the basic definition of computability 
used by Pour-El and Richards (see section l2T6"|) . whereas the second characteriza- 
tion is the starting point for domain theoretic characterizations of computability 

[MM- 

Lacombe's pioneering work opened up two research directions in computable 
analysis which were to play an important role. First, he initiated the study of 
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c.e. open sets of reals, which are open sets that can be effectively obtained 
as unions of rational intervals |139j . Together with Kreisel, Lacombe proved 
that there are c.e. open sets that contain all computable reals and that have 
arbitrarily small Lebesgue measure [124] . Secondly, Lacombe anticipated the 
study of computability on more abstract spaces than Euclidean space, and was 
one of the first to define the notion of a computable metric space |143j . 

2.6 Computable analysis in North America 

More comprehensive approaches to computable analysis were developed in the 
United States during the 1970's and 1980's. One stream of results is due to 
Oliver Aberth, and is accumulated in his book, Computable Analysis [2]. His 
work is built on Markov's notion of computability for real number functions. We 
call a function / : R c — 5- M c on computable reals Markov computable if there is 
an algorithm that transforms algorithms that describe inputs x into algorithms 
that describe outputs f(x). Among other results, Aberth provided an example 
of a Markov computable function / : R c — > R c that cannot be extended to a 
computable function on all real numbers [2j Theorem 7.3]. Such a function / 
can, for instance, be constructed with a Specker sequence (x n ), which is a com- 
putable monotone increasing bounded sequence of real numbers that converges 
to a non-computable real x. Using effectively inseparable sets, such a Specker 
sequence can even be constructed so that it is effectively bounded away from 
any computable point (see, for instance, the Effective Modulus Lemma in [176 ). 
One can use such a special Specker sequence and place increasingly steeper peaks 
on each Figure [T] If adjusted appropriately, this construction leads 




Figure 1: Construction of a function / : R — > R 



to a Markov computable function / : R c — > R c that cannot be extended to a 
computable function on the whole of R. In fact, it cannot even be extended 
to a continuous function on R, since the peaks accumulate at x. Pour-El and 
Richards [1 761 Theorem 6, page 67] later refined this counterexample to obtain 
a diffcrcntiable, non-computable function / : R — > R whose restriction to the 
computable reals R c is Markov computable. This can be achieved by replacing 
the peaks in Aberth's construction by smoother peaks of decreasing height. 

Another well-known counterexample due to Aberth [T] is the construction of 
a function F : R^ — >• R c that is Markov computable and uniformly continuous 
on a rectangle centered around the origin, with the property that the differential 
equation 

y'(x) = f(x,y(x)) 
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with initial condition y(0) = has no computable solution y, not even on an 
arbitrarily small interval around the origin. This result can be extended to 
computable functions / : M 2 R, which was proved somewhat later by Pour- 
El and Richards 177 . It can be interpreted as saying that the Peano Existence 
Theorem does not hold computably. 

Marian Pour-El and J. Ian Richards' approach to computable analysis was 
the dominant approach in the 1980's, and is nicely presented in their book |176j . 
The basic idea is to axiomatize computability structures on Banach spaces using 
computable sequences and a characterization of computability that is derived 
from Grzegorczyk's first characterization, which was mentioned in section 12.51 
This approach is tailor-made for well-behaved maps such as linear ones. One 
of their central results is their First Main Theorem |179j . which states that a 
linear map T : X — > Y on computable Banach spaces is computable if and only 
if it is continuous and has a c.e. closed graph, i.e. 

T computable T continuous and graph(T) c.e. closed. 

Moreover, the theorem states that a linear function T with a c.e. closed graph 
satisfies 

T computable <=> T maps computable inputs to computable outputs. 

Here a closed set is called c.e. closed if it contains a computable dense sequence. 
The crucial assumption that the graph(T) is c.e. closed has sometimes been 
ignored tacitly in presentations of this theorem and this has created the wrong 
impression that the theorem identifies computability with continuity. In any 
case, the theorem is an important tool and provides many interesting coun- 
terexamples. This is because the contrapositive version of the theorem implies 
that every linear discontinuous T with a c.e. closed graph maps some computable 
input to a non-computable output. For instance, the operator of differentiation 

d:CC[0,l]4C[0,l],/4/' 

is known to be linear and discontinuous and it is easily seen to have a c.e. closed 
graph (for instance the rational polynomials are easily differentiated) and hence 
it follows that there is a computable and continuously differentiable real number 
function / : [0, 1] — > K whose derivative /' is not computable. This is a fact 
that has also been proved directly by Myhill 163 . 

The First Main Theorem is applicable to many other operators that arise 
naturally in physics, such as the wave operator [178] 1175] . In particular, it 
follows that there is a computable wave (x,y,z) i— > u(x, y, z, 0) that transform 
into a non-computable wave (x,y,z) H ¥ u(x, y, z,l) if it evolves for one time 
unit according to the wave equation 

d 2 u d 2 u d 2 u d 2 u 
dx 2 dy 2 dz 2 dt 2 

from initial condition ^ = 0. This result has occasionally been interpreted 
as suggesting that it might be possible to build a wave computer that would 
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violate Church's Thesis. However, a physically more appropriate analysis shows 
that this is not plausible (see Section [2~7l and also [172] ). Pour-El and Richards' 
definitions and concepts regarding Banach spaces have been transferred to the 
more general setting of metric spaces by Yasugi, Mori and Tsujii |235j in Japan. 

Another stream of research on computable analysis in the United States came 
from Anil Nerode and his students and collaborators. For instance, Metakides, 
Nerode and Shore proved that the Hahn-Banach Theorem is not computably 
valid, but admits a non-uniform computable solution in the finite dimensional 
case |157U158j . Joseph S. Miller studied sets that appear as the set of fixed points 
of a computable self map of the unit ball in Euclidean space [159] . Among other 
things he proved that a closed set A C [0, l] n is the set of fixed points of a com- 
putable function / : [0, 1]" — > [0, 1]" if and only if it is co-c.e. closed and has 
a co-c.e. closed connectedness component. Zhou also studied computable open 
and closed sets on Euclidean space |242j . In a series of papers |103l 11041 1102] , 
Kalantari and Welch provided results that shed further light on the relation be- 
tween Markov computable functions and computable functions. Douglas Cenzer 
and Jeffrey B. Remmel have studied the complexity of index sets for problems 
in analysis [33 [38l [39] . McNicholl and his co-authors have started a study of 
computable complex analysis [1561 11551 11521 1153] . 

It is possible to study computable functions of analysis from the point of 
view of computational complexity. We cannot discuss such results in detail 
here, but in passing we mention the important work of Harvey Friedman and 
Ker-I Ko [III OH HHl EH El QUO, which is well summarized in [TTB] : and, 
more recently, interesting work of Mark Braverman, Stephen Cook and Akitoshi 
Kawamura [28l [27l [29l 11051 I106j . Braverman and Michael Yampolsky have 
recently written a nice monograph on computability properties of Julia sets 

G23- 

2.7 Computable analysis in Germany 

In Eastern Germany, Jurgen Hauck produced a sequence of interesting papers on 
computable analysis that were only published in German [86, ; 8TJ [88] , building 
on the work of Dieter Klaua [1071 1108 . The novel aspect of Hauck's work is that 
he studied the notion of a representation (of real numbers or other objects) in 
its own right. This aspect has been taken up in the work of Weihrauch's school 
of computable analysis [224] . which has its origins in the work of Christoph 
Kreitz and Klaus Weihrauch [123 HH HH [222] . Kreitz and Weihrauch began 
to develop a systematic theory of representations that is centered around the 
notion of an admissible representation. Representations Sx -Q N N — > X are 
(potentially partial) maps that are surjective. Admissible representations Sx are 
particularly well-behaved with respect to some topology given on the represented 
set X. If Sx and Sy are admissible representations of topological spaces X and 
Y, respectively, then a (partial) function / :C X — > Y is continuous if and only 
if there exists a continuous F :C N N — > N N on Baire space N N such that the 
diagram in Figure [2] commutes. 

The perspective arising from the concept of a representation as a map sheds 
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Figure 2: Notions of computable real number functions 



new light on representations of real numbers as they were studied earlier by 
Specker, Mostowski and others. Their results can be developed in a more 
uniform way, using the concept of computable reducibility for representations, 
which yields a lattice of real number representations. The position of a repre- 
sentation in this lattice characterizes the finitely accessible information content 
of the respective representation, see Figure [3J In particular, the behavior of 
representations that was mysterious to Mostowski can be explained naturally in 
this way. 



Naive Cauchy representation 

Enumeration Enumeration 
P< P> 
of left cuts of right cuts 

Cauchy representation p = p < n p> 
t 

Representation ph for basis b 

Characteristic functions Characteristic functions 

of left cuts of right cuts 

Continued fraction representation 



Figure 3: The lattice of real number representations 



The concept of an admissible representation has been extended to a larger 
classes of topological spaces by the work of Matthias Schroder |193j . In his 
definition, an admissible representation with respect to a certain topology on 
the represented space is just one that is maximal among all continuous repre- 
sentations with respect to computable reducibility. Hence, admissibility can be 
seen as a completeness property. 
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Many other interesting results have been developed using the concept of 
a representation (see, for instance, the tutorial [25] for a more detailed sur- 
vey). For instance, Peter Hertling jM] proved that the real number structure 
is computably categorical in the sense that up to computable equivalence there 
is one and only one representation that makes all the ingredients of the struc- 
ture computable. He also studied a computable version of Riemann's Mapping 
Theorem [90], and proved that the Hyperbolicity Conjecture implies that the 
Mandelbrot set M C R 2 is computable in the sense that its distance function is 
computable. Finally, he also proved that there is a Banach-Mazur computable 
function / : M c — > K c that is not Markov computable [52] ■ 

Matthias Schroder and Klaus Weihrauch have studied a very general setting 
for computational complexity in analysis using proper representations [1901 12251 
194 and Schroder has also studied online complexity of real number functions 
[1911 1192] . following earlier work of Schdnhage [189] . Robert Rettinger and 
Klaus Weihrauch studied the computational complexity of Julia sets [180] . In 
another interesting stream of papers weaker notions of computability on real 
numbers were studied [2401 11811 1182) by Xizhong Zheng, Klaus Weihrauch and 
Robert Rettinger. 

Ning Zhong and Klaus Weihrauch studied computability of distributions 
|241[ and proved computability of the solution operator of several partial dif- 
ferential equations [1211 H2E1 H21 dUl [HI] • In particular, they revised the the 
wave equation and proved that if this equation is treated in physically realistic 
spaces, then it turns out to be computable [227] . 

Martin Ziegler has studied models of hypercomputation and related notions 
of computability for discontinuous functions |243j . Martin Ziegler and Stephane 
Le Roux have revisited co-c.e. closed sets. Among other things, they proved that 
the Bolzano- Weierstrafi Theorem is not limit computable, in the sense that there 
is a computable sequence in the unit cube without a limit computable cluster 
point [144 . Baigger [10] proved that the Brouwer Fixed Point Theorem has 
a computable counterexample (essentially following Orevkov's ideas |167j . who 
proved the corresponding result for Markov computability much earlier). 

Just as the computational content of representations of real numbers can 
be analyzed in a lattice, there is another lattice that allows one to classify 
the computational content of (forall-exists) theorems. This lattice is based 
on a computable reduction for functions that has been introduced by Klaus 
Weihrauch and that has recently been studied more intensively by Arno Pauly, 
Guido Gherardi, Alberto Marcone, Brattka, and others [1711 [23 ] [21] . Theorems 
such as the Hahn-Banach Theorem [75], the Baire Category Theorem, the In- 
termediate Value Theorem [22], the Bolzano- Weierstrafi Theorem [24] and the 
Radon-Nikodym Theorem [100] have been classified in this lattice as well as 
finding Nash equilibria or solving linear equations [170] . Figure @] shows the 
respective relative positions of these theorems in the Weihrauch lattice. This 
classification yields a uniform perspective on the computational content of all 
these theorems, and the position of a theorem in this lattice fully characterizes 
certain properties of the respective theorem regarding computability. In partic- 
ular, it implies all aforementioned computability properties of these theorems. 
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Figure 4: The computational content of theorems in the Weihrauch lattice 



2.8 Computable measure theory 

The field of algorithmic randomness relies on effective notions of measure and 
measurable set. Historically, debates in the 1930's as to how to make sense of the 
notion of a "random" sequence of coin flips gave rise to the Kolmogorov axioms 
for a probability space, and the measure-theoretic interpretation of probability. 
On that interpretation, the statement that a "random" element of a space has 
a certain property should be interpreted as the statement that the set of coun- 
terexamples has measure 0. In that framework, talk of random elements is only 
a manner of speaking; there is no truly "random" sequence of coin flips x, since 
each such clement is contained in the null set {x}. 

In the 1960's, Per Martin-L6f, a student of Kolmogorov, revived the no- 
tion of a random sequence of coin flips |149[ I150j . The idea is that, from a 
computability-theoretic perspective, it makes sense to call a sequence random 
if it avoids all suitably effective null sets. For example, a Martin-Ldf test is an 
effectively null Gs set, which is to say, an effective sequence of computably open 
sets Gi with the property that for each i, the Lebesgue measure of Gi is less 
than 2~\ An sequence of coin flips fails such a test if it is in the intersection of 
the sets Gi. An element of the unit interval is said to be Martin-Ldf random if 
it does not fail any such test. Osvald Demuth [48j defined an equivalent notion, 
independently, in 1975. 

Martin-L6f randomness can be given alternate characterizations in terms of 
computable betting strategies, or in terms of information content. There are 
by now many other notions of randomness in the literature. One, based on the 
influential work of C.-P. Schnorr [188, 187], is similar to Martin-L6f randomness 
but requires that, moreover, the measures of the open sets Gi are uniformly 
computable. It is impossible to even begin to survey this very active area here, 
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but detailed overviews can be found in the monographs [1651 [50] . Kucera and 
Nies |135j provides a nice discussion of Demuth's contributions to the subject. 

Measure theory, more generally, has been studied from a computability- 
theoretic perspective. Most work in algorithmic randomness has focused on 
random elements of 2 N , or, equivalently, random elements of the unit inter- 
val [0,1]. But there are a number of ways of developing the notion of a com- 
putable measure on more general spaces; see the work of Gacs [66] , Hertling and 
Weihrauch [93] and Hoyrup and Rojas [OS]. Weihrauch |223j and Schroder [195j 
have discussed different representations of probability measures and Weihrauch 
and Wu [234] have further developed the foundations of computable measure 
theory. Edalat [53J has studied computable measure and integration theory 
from the perspective of domain theory. Bosserhoff [20] and Hoyrup and Ro- 
jas [98] have defined various notions of computability for measurable functions. 
This makes it possible to consider results in measure theory, probability, and 
dynamical systems in computability-theoretic terms. 

For example, computable aspects of the ergodic theorems have been stud- 
ied extensively. The pointwise and mean ergodic theorems assert that if T is 
a measure-preserving transformation of a finite measure space and / is any 
measurable function, then the ergodic averages A n f = i Yui<n f °T l converge 
pointwise, and in the L 2 norm, respectively. V'yugin [2201 1221) and, indepen- 
dently, Avigad and Simic [9] 11971 [SJ have shown that the mean and pointwise 
ergodic theorems do not hold computably, but Avigad, Gerhardy, and Townser 
[8] have shown that a rate of convergence can be computed from T, /, and 
the norm of the limit. In particular, the rate of convergence can be computed 
from T and / when the transformation is ergodic. V'yugin [220, 221. has shown 
that if T is a measurable transformation of the unit interval and / is a com- 
putable function, then the sequence of ergodic averages (A n f) converges on 
every Martin-L6f random real. Franklin and Towsner |60] have recently shown 
that this is sharp. Hoyrup and Rojas [69] have generalized V'yugin's result to 
computable measurable functions. Indeed, the work of Gacs, Galatolo, Hoyrup, 
and Rojas 68, 67 , together with the results of [5], shows that the Schnorr ran- 
dom reals are exactly the ones at which the ergodic theorem holds with respect 
to every ergodic measure and computable measure-preserving transformation 
(see also the discussion in |166| ). Bienvenu, Day, Hoyrup, and Shen [15] and, 
independently, Franklin, Greenberg, Miller, and Ng [59] have extended V'yugin's 
result to the characteristic function of a computably open set, in the case where 
the measure is ergodic. 

Other theorems of measure-theory and measure-theoretic probability have 
been considered in this vein. (Such work is, to some extent, foreshadowed by 
the study of measure theory in the context of reverse mathematics, by Simpson, 
Yu, and others; see, for example, |2381 12361 1237] .) The Lebesgue differentiation 
theorem states, roughly speaking, that for almost every point of M n , the value of 
an integrable function at that point is equal to the limit of increasingly small av- 
erages taken around that point. Pathak |169] showed that, when the integrable 
function is computable in an appropriate sense, the theorem is true at every 
Martin-L6f random point. Pathak, Rojas, and Simpson [166] later strengthened 
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the conclusion to the Schnorr-random real numbers, and showed that this is 
sharp; this same result was obtained independently by Rute [186] . Another the- 
orem of Lebesgue's states that a function of bounded variation is differentiable 
almost everywhere. Recasting results of Demuth [35], Brattka, Miller, and Nies 
[21)] have shown that when the function in question is computable, the result 
holds at Martin-L6f random real numbers, and that this is sharp. Moreover, 
they obtained clean characterizations of other notions of randomness in terms 
of differentiability. Freer, Kjos-Hansen, Nies, and Stephan \5T\ have obtained 
similar results for the class of Lipschitz functions. Rute [186] has considered a 
number of differentiability and martingale convergence theorems, characterizing 
the points at which the theorems hold, and determining when rates of conver- 
gence are and are not computable. Ackerman, Freer, and Roy have studied the 
computability of conditional distributions 3 , Hoyrup, Rojas, and Weihrauch 
have studied the computability of Radon- Nikodym derivatives [55], and Freer 
and Roy have provided a computable version of de Finetti's theorem [52"] . 

2.9 Appendix: Notions of computability for real number 
functions 

In this appendix, we summarize the various notions of computability for real 
number functions that were discussed in previous sections, and we briefly indi- 
cate the logical relations between these notions. For simplicity we restrict the 
discussion to functions that are total on all real numbers or on all computable 
real numbers. 

A rapidly converging Cauchy name for a real number a: is a sequence (xi)ien 
of rationals that converges to x rapidly, i.e. in such a way that for every i and 
j > i, \xj — Xi\ < 2~ l . (Any other computable rate of convergence would work 
equally well.) 

1. / : R — ► R is called computable if there is an algorithm that transforms 
each given rapidly converging Cauchy name of an arbitrary x into such a 
name for f(x) (on a Turing machine with one-way output tape). 

2. / : R c — ► R c is called Borel computable if / is computable in the previously 
described sense, but restricted to computable reals. 

3. / : R c — 5- R c is called Markov computable if there is an algorithm that 
converts each given algorithm for a computable real x into an algorithm 
for f(x). Here an algorithm for a computable real number x produces a 
rapidly converging Cauchy name for x as output. 

4. / : R c — > R c is called Banach-Mazur computable if / maps any given 
computable sequence (x n ) of real numbers into a computable sequence 
( f(x n )) of real numbers. 

The diagram in Figure [5] summarizes the logical relations between different 
notions of computability. Some of these equivalences can be generalized to 
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partial functions with well-behaved domains, and to other computable metric 
spaces (see [89 ) [92 ] ). 
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Figure 5: Notions of computable real number functions 



3 Constructive analysis 

As noted in Section [1] the growing divergence of conceptual and computational 
concerns in the early twentieth century left the mathematical community with 
two choices: either restrict mathematical language and methods to preserve 
direct computational meaning, or allow more expansive language and methods 
and find a place for computational mathematics within it. We have seen how the 
theory of computability has supported the latter option: given broad notions of 
real number, function, space, and so on, we can now say, in precise mathematical 
terms, which real numbers, functions, and spaces are computable. 

There is still, however, a community of mathematicians that favors the first 
option, namely, adopting a "constructive" style of mathematics, whereby theo- 
rems maintain a direct computational interpretation. This simplifies language: 
rather than say "there exists a computable function /" one can say "there exists 
a function /," with the general understanding that, from a constructive stand- 
point, proving the existence of such a function requires providing an algorithm 
to compute it. Similarly, rather than say "there is an algorithm which, given x, 
computes y such that . . . ," one can say "given x, there exists y such that ..." 
with the general understanding that a constructive proof, by its very nature, 
provides such an algorithm. 

In sum, from a constructive point of view, all existence claims are expected to 
have computational significance. We will see, however, that this general stricture 
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is subject to interpretation, especially when it comes to infinitary mathematical 
objects. Today there are many different "flavors" of constructive mathematics, 
with varying norms and computational interpretations. Here the methods of 
logic and computability theory prove to be useful yet again: once we design a 
formal deductive system that reasonably captures a style of constructive rea- 
soning, we can provide a precise computational semantics. This serves to clarify 
the sense in which a style of constructive mathematics has computational sig- 
nificance, and helps bridge the two approaches to preserving computational 
meaning. 

This section will provide a brief historical overview of a some of the differ- 
ent approaches to constructive mathematics, as well as some associated formal 
axiomatic systems and their computational semantics. Troelstra and van Dalcn 
[207) and Beeson [14] provide more thorough introductions to these topics. 

3.1 Kroneckerian mathematics 

In 1887, facing the rising Dedekind-Cantor revolution, the mathematician Leopold 
Kronecker wrote an essay in which he urged the mathematical community to 
maintain focus on symbolic and algorithmic aspects of mathematics. 

[A] 11 the results of the profoundest mathematical research must in 
the end be expressible in the simple forms of the properties of in- 
tegers. But to let these forms appear simply, one needs above all 
a suitable, surveyable manner of expression and representation for 
the numbers themselves. The human spirit has been working on this 
project persistently and laboriously since the greyest prehistory. . . . 
(H2B page 955]) 

One gets a better sense of Kronecker's views by considering the way they played 
out in his work, such as his treatment of algebraic numbers in his landmark 
Grundziige einer arithmetischen Theorie der algebraischen Grossen 128 , or in 
his treatment of the fundamental theorem of algebra in Ein Fundamentals atz 
der allgemeinen Arithmetik [129]. In general, Kronecker avoided speaking of 
"arbitrary" real numbers and functions, and, rather, dealt with algebraic sys- 
tems of such things, wherein the objects are given by symbolic expressions, and 
operations on the objects are described in terms of symbolic calculations. Of 
course, it is still an important task to understand how rational approximations 
can be obtained from an algebraic description of a real number. But even when 
dealing with convergent sequences and limits (as in, for example, the proof 
of Dirichlet's theorem on primes in an arithmetic progression in his lectures 
on number theory), Kronecker always provided explicit information as to how 
quickly a sequence converges. The approach is nicely described by his student 
Hensel: 

He believed that one can and must in this domain formulate each 
definition in such a way that its applicability to a given quantity can 
be assessed by means of a finite number of tests. Likewise that an 
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existence proof for a quantity is to be regarded as entirely rigorous 
only if it contains a method by which that quantity can really be 
found. ([HE], quoted in Stein [203| P- 250].) 

A number of articles by Harold Edwards (including [55J [SHI EZ]) provide an 
excellent overview of Kronecker's outlook and contributions to mathematics. 

3.2 Brouwerian intuitionism 

During the 1910's, the Dutch mathematician L. E. J. Brouwer advanced a new 
philosophy of mathematics, intuitionism, which made strong pronouncements 
as to the appropriate methods of mathematical reasoning. Brouwer held that 
mathematical objects are constructions of the mind, and that we come to know 
a mathematical theorem only by carrying out a mental construction that enables 
us to see that it is true. In particular, seeing that a statement of the form AAB 
is true involves seeing that A is true, and that B is true; seeing that A — > B 
is true involves having a mental procedure that transforms any construction 
witnessing the truth of A to a construction witnessing the truth of B; seeing 
that -i A is true involves having a procedure that transforms a construction 
witnessing the truth of A to a contradiction (that is, something which, evidently, 
cannot be the case); and seeing that A V B is true requires carrying out a 
mental construction enabling one to see that A is true, or carrying out a mental 
construction to see that B is true; and similarly for statements involving the 
universal and existential quantifiers. This account, developed further by Heyting 
and Kolmogorov, has come to be known as the BHK interpretation. It is not 
hard to see that, according to this interpretation, the law of the excluded middle, 
A V -iA, does not generally hold. For example, if A is the statement of the 
Goldbach conjecture, then we cannot presently assert A V ->A, because we do 
not currently know that the Goldbach conjecture is true, nor do we know that 
it is false. 

There is a strong solopsistic strand to Brouwer's philosophy, in that the 
mathematical knowledge one has is a reflection of one's inner mental life, inde- 
pendent of an external world or other thinkers. He also held that mathematical 
knowledge and thought are independent of language, which he took to be a 
deeply limited and flawed means of communicating mathematical ideas. Intu- 
itionism may seem to have little to do with computation per se, but, as we will 
see below, if one replaces Brouwer's mental constructions and procedures with 
symbolic representations and algorithms, one is left with an essentially compu- 
tational view of mathematics. Thus, Brouwer's mathematical ideas have proved 
to be influential in computer science, despite the different motivations. 

Brouwer was also an influential topologist, and an important part of his 
intuitionist program was to develop an intuitionistically appropriate foundation 
for reasoning about the continuum [321 133] . To that end, he introduced the 
notion of a choice sequence, that is, an "arbitrary" sequence of natural numbers. 
Some choice sequences are generated by a law, such as the sequence 0, 0, 0, . . .. 
At the other extreme, some sequences are "lawless," that is, generated by events 
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that are not predetermined. Brouwer introduced the continuity principle, which, 
roughly speaking, asserts that any function from choice sequences to the natural 
numbers is continuous; in other words, the value of the function at a given choice 
sequence depends only on a finite initial segment of that sequence. (See Troelstra 
and van Dalen [207, 1.4.6] for a precise formulation.) 

Brouwer went on to develop an intuitionistic set theory based on such se- 
quences. A spread, in his terminology, is a tree on N such that each node has at 
least one successor. Brouwer saw this data as a way of specifying a collection of 
choice sequences; that is, a choice sequence is in the spread if and only if every 
initial segment is in the tree. A fan is a spread with the property that every 
node has only finitely many successors. At the risk of clouding some intuitions, 
we can translate these notions to classical terms: a choice sequence is an ar- 
bitrary element of Baire space (the space of functions from N to N under the 
product topology), a spread corresponds to a closed subset of Baire space, and 
a fan corresponds to a compact subset of Baire space. 

In 1927, Brouwer 34 introduced the bar theorem, which, despite the name, 
is essentially an axiomatic principle that provides transfinite induction on well- 
founded trees on N. Roughly speaking, it asserts that any property that holds 
outside and at the leaves of such a well-founded tree, and is preserved in passing 
from all the children of a node to the node itself, holds of every finite sequence. 
An immediate corollary is the fan theorem, which asserts that a well-founded fan 
is finite. Using the fan theorem and the continuity principle, Brouwer showed 
that every function from [0, 1] to K is uniformly continuous. When properties 
of choice sequences are understood in terms of computable predicates on Baire 
space, this formulation of analysis accords quite well with Grzegorczyk's notion 
of computability, as discussed in Section [231 

For more information on intuitionistic mathematics, see [SH [Ml 12071 [3Tj . 
For more on Brouwer and his philosophical views, see [216, 217 . 

3.3 Early formal systems for constructive mathematics 

Now let us consider some formal axiomatic systems that capture such construc- 
tive styles of reasoning. A theory now known as primitive recursive arithmetic 
was designed by Thoralf Skolem [199] to capture Hilbert's notion of "finitary" 
mathematical reasoning. In its strictest form, the theory has variables ranging 
over the natural numbers, but no quantifiers. One starts with some basic func- 
tions on the natural numbers, and is allowed to define new functions using a 
schema of primitive recursion: 

• /(0, zi, . . . , z n ) = g(zi, z n ), and 

• f{x + l,zi,...,z n ) = h{x,f{x,z x ,...,z n ),z x ,...,z n ), 

where g and h have previously been defined. Although this schema may seem 
limited, with ingenuity one can show that almost any reasonable function (and 
hence relation) on the natural numbers is primitive recursive. In particular, 
the primitive recursive relations are closed under bounded quantification. As a 
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result, primitive recursive arithmetic can be viewed as a reasonable framework 
in which to carry out a Kroneckarian constructivism. 

Indeed, primitive recursive arithmetic is often taken as a starting point for 
formalizing basic mathematical notions in many presentations of formal ax- 
iomatic foundations for mathematics, classical or constructive. It plays a cen- 
tral role in Hilbert and Bernays' landmark Grundlagen der Mathematik [96] . 
in Kleene's Introduction to Metamathematics [lllj . and Goodstein's Recursive 
Number Theory [75] ■ Goodstein's later Recursive Analysis [50] develops topics 
like integration and differentiation on the basis of primitive recursive arithmetic. 

Today, finitism is viewed as an extreme form of intuitionism. Foundational 
writings in the 1920's, however, do not show a clear distinction between the two. 
Indeed, as late as 1930, von Neumann used the two terms interchangeably in his 
exposition of formalism [219] at the Second Conference for Epistemology and 
the Exact Sciences in Konigsberg. (This was, incidentally, the meeting where 
Godel announced the first incompleteness theorem.) 

The situation was clarified when Arend Heyting, a student of Brouwer's, pro- 
vided formal axiomatizations of intuitionistic mathematics, despite Brouwer's 
antipathy towards formalization. Specifically, he provided formal treatments of 
intuitionistic logic, arithmetic, and Brouwer's theory of choice sequences, which 
appeared in a series of three papers published in 1930 [05]. In 1933, Godel [76] . 
and later Gentzen [70] , showed that classical first-order arithmetic could be in- 
terpreted in intuitionistic first-order arithmetic via what is now known as the 
double-negation translation. At the time, classical first-order arithmetic, which 
extends primitive recursive arithmetic with quantifiers ranging over the natural 
numbers and induction for all formulas in the language, was viewed as strictly 
stronger than finitism. The interpretation of classical arithmetic in intuitionis- 
tic arithmetic thereby prompted the realization that intuitionism and finitism 
diverge as well. 

3.4 Realizability 

The developments described in this section so far all predate Turing's analysis of 
computability. Once notions of computability were in place, however, it was not 
long before Stephen Kleene showed that they can be used to provide a precise 
sense in which intuitionistic mathematics has a computational interpretation. 
Towards the end of a paper [109j published in 1943, but first presented to the 
American Mathematical Society in 1940, he put forth the following: 

Thesis III. A proposition of the form (x)(Ey)A(x,y) containing no 
free variables is provable constructively, only if there is a general 
recursive function <j>(x) such that (x)A(x, <fi(x)). ( [1091 Section 16, 
page 69]) 

One can turn this thesis into theorems by showing that the statement holds when 
we replace "constructive provability" by provability in particular axiomatic sys- 
tems. Kleene carried out this program over the next few years, together with 
his student, David Nelson [110[ 1164] , For that purpose, Kleene introduced the 
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notion of "realizability," which is, roughly speaking, an analysis of the Brouwer- 
Heyting-Kolmogorov interpretation in computability-theoretic terms. Specifi- 
cally, Kleene defined a relation e realizes ip inductively, where e is a natural 
number and ip is a sentence in the language of first-order arithmetic, with the 
following clauses: 

• If p is atomic, then e realizes tp if and only if it is true. 

• e realizes 9 A -q if and only if e is of the form 2 a 3 b , where a realizes 9 and 
b realizes 77. 

• e realizes 9 V f] if and only if e is of the form 2°3 a and a realizes 9, or e is 
of the form 2 1 3 b and 6 realizes rj. 

• e realizes 6* — ^ r/ if and only if e is the Gddel number of a partial recursive 
function / which, given any a realizing 9, returns a number /(a) realizing 

7]. 

• e realizes 3x 9{x) if and only if e is of the form 2 x i a and a realizes 6(x), 
where x is the numeral that denotes x. 

• e realizes Vx 0(x) if and only if e is the Godel number of a partial recursive 
function / which, given any x, returns a number f(x) realizing 9(x). 

More generally, if <p has free variables, e realizes ip if and only if it realizes its 
universal closure. A formula ip is said to be realizable if there is some number 
realizing it. 

Kleene emphasized that this does not provide a reductive analysis of intu- 
itionistic truth, insofar as quantifiers and logical connectives themselves appear 
in the definition. In particular, if <p is a true, purely universal sentence, then 
anything realizes p; and a realizer for a negated sentence carries no useful in- 
formation at all. However, for any sentence <p in the language of arithmetic, the 
statement "e realizes <p" can also be expressed in the language of arithmetic, 
and one can inquire as to how these two assertions are related to one another. 
Nelson showed that if intuitionistic logic proves p, then it proves that p> is real- 
izable. Unfortunately, it is not the case that for any p, intuitionistic arithmetic 
proves that <p is equivalent to its own realizability; but Nelson showed that in- 
tuitionistic arithmetic does prove that this equivalence is realizable. Moreover, 
one can strengthen the clause for implication: 

• e realizes ^? ^ 77 if and only if 9 implies -q and e is the Godel number 
of a partial recursive function / which, given any a realizing 9, returns a 
number /(a) realizing 77. 

In that case, intuitionistic arithmetic proves that a formula <p is true if and only 
if it is realizable. 

The idea that a constructive proof provides explicit "evidence" for the truth 
of a theorem in question, and that such evidence can often be described in 
computability-theoretic terms, is a powerful one. It can help illuminate the 
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"meaning" or "computational content" of a formal axiomatic system; and, as a 
purely technical device, it can be used to obtain metamathematical properties 
of such systems, such as provability and unprovability results. In 1967, Kleene 
and Vesley |112j presented a formalization of Brouwerian analysis together with 
a suitable realizability interpretation. 

By now, there is a bewildering array of realizability relations in the literature. 
Variations can depend on any of the following features: 

• the language expressing the mathematical assertions (first-order, second- 
order, or higher-order, etc.); 

• the kinds of realizers (arbitrary computable functions, computable func- 
tions in a particular class, or a broader class of functions); 

• the descriptions of the realizers (e.g. whether they are represented by 
natural numbers, or terms in a formal language); 

• whether or not the realizability relation itself is expressed in a formal 
system; or 

• the particular clauses of the realizability relation itself (see, for example, 
the variant of the clause for implication above). 

All these decisions have bearing on the axioms and rules that are realized, 
and the metamathematical consequences once can draw. See Troelstra [206j 
for a definitive reference, as well as [UJ 1205] for more information. Realizabil- 
ity theory can also be used to translate results from constructive analysis into 
computable analysis; see [I2lll45j . 

3.5 The Russian school of constructive mathematics 

The post-Turing era brought a new approach to constructive mathematics, the 
principal tenets of which were set forth by A. A. Markov in the late 1940's and 
developed through the 1950's [1471 1148] , The result is what has come to be 
known as the "Markov school" or "Russian school" of constructive mathemat- 
ics, with contributions by Nikolai Shanin, I. Zaslavskh, Gregory Ceitin, Osvald 
Demuth, and Boris Kushner, among many others@ Aspects of their work have 
already been discussed above in connection with computable analysis. Indeed, 
a hallmark of this style of constructivity is that it relies explicitly on notions 
of computability, which is to say, the real numbers are explicitly defined to be 
computable reals; the notion of a function from the natural numbers to the 
natural numbers is explicitly defined to be a computable function; and so on. 
In contrast to contemporary computable analysis, however, the Russian school 
insisted that proofs also have a constructive character, so that, for example, 
a proof of a statement of the form \lx 3y ip(x, y) statement can be seen to 
yield a computable dependence of y on x. This style of constructivity can be 

6 A detailed bibliography can be found on the Computability and Complexity in Analysis 
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viewed as "constructive computable mathematics," or "constructive recursive 
mathematics," and, indeed, is often referred to as such. 

Note that this style of constructivity stands in stark contrast to Brouw- 
erian intuitionism: even if Brouwer could have identified his "constructions" 
with formal notions of computability, he would have been unlikely to do so. 
The Russian school also adopted a principle that is not found in Brouwerian 
intuitionism, namely, Markov's principle. This states that if P is a decidable 
property of natural numbers (that is, P(x) V ~^P(x) holds for every x), and it is 
contradictory that no x satisfies P, then some x satisfies P. In symbols: 

-iVa; ->P(x) -)• 3xP(x). 

The intuition is that one can find an x satisfying P{x) by simply searching for 
it systematically, since the hypothesis guarantees that the search cannot fail to 
turn up such an x. 

One can find good overviews of this style of constructivity in books by Aberth 
[2] and Kushner [133) . as well as Kushner's survey [134] . 

3.6 Theories of finite types 

In constructive recursive mathematics, one can interpret functions as Turing- 
machine indices, which can, in turn, be thought of as descriptions of a computer 
program. Thus, in constructive recursive mathematics, sets and functions can 
be "coded," or represented, by natural numbers. But ordinary mathematics 
deals not just with sets and functions of natural numbers, but also with sets 
of sets, sets of functions, functionals defined on spaces of functions, and so 
on. Rather than represent all of these using indices for computable objects, it 
is more natural, for some purposes, to keep the computational interpretation 
implicit, and take mathematical objects at face value. 

To that end, it is convenient to adopt the language of finite types. The 
idea traces back to foundational frameworks designed by Frege, Russell and 
Whitehead, and Church, which will be discussed in Section 13.81 Roughly, a 
"type" can be thought of as a syntactic classification of mathematical objects. 
To obtain the finite types, start with the type N, intended to denote the natural 
numbers, so that an object of type N is a natural number. Add the rule that 
whenever A and B are finite types, so is A — > B. Intuitively, an object of type 
A — > B is a function from A to B. Thus, we can form the type N — > N of functions 
from N to N, the type (N — > N) — > N of functionals from N -> N to N, and so on. 
It is sometimes also convenient to add a base type Bool for the Boolean values 
"true" and "false," and product types A x B, but these are inessential. 

Following Godel |77) , we can extend the set of primitive recursive functions to 
the set of primitive recursive functionals of finite type by extending the schema 
of primitive recursion to the higher types. This provides a syntactic calculus for 
defining objects of the various types, and Godel's theory T provides a calculus 
for reasoning about these objects. 

There are then two ways of giving T a computational interpretation. The 
first is to remain at the level of syntax: one provides an explicit procedure to 



29 



"reduce" any term in the calculus to a canonical normal form (see Tait |204) and 
Girard 75 ). Thus if F is a term of type A — ► B, one can view F as denoting 
the function which takes any term t, in normal form, and returns the normal 
form corresponding to F(t). 

A second approach, however, provides a more natural computational un- 
derstanding of the finite types. For each type a, define the set of hereditarily 
recursive functions of type a, HROo-, inductively as follows: the hereditarily 
recursive functions of type N are simply the natural numbers, and the heredi- 
tarily recursive functions of type A to B are those indices e such that for every 
hereditarily recursive function x of type A, ip e (x) is defined, and is a heredi- 
tarily recursive function of type B. Thus the hereditarily recursive functions of 
type N —> N are (indices of) total computable functions; hereditarily recursive 
functions of type (N-)N)-*N are computable functions which, for each total 
computable function, return a number; and so on. It is then easy to interpret 
each term of T as a hereditarily recursive functional. 

In general, hereditarily recursive functions are not extensional: because func- 
tions act on indices, it can happen that two indices e and e' compute the same 
function from N to N, and yet F(e) ^ F(e') for some hereditarily recursive 
functional F. One can repair this by inductively defining extensional equality 
between functionals, and insisting that the interpretations of the function types 
preserve this equality. The resulting set of functionals is then called the heredi- 
tarily effective operations, HEO. This is an instance of what computer scientists 
refer to as a PER model, since for each a, HEO CT is given by a partial equiva- 
lence relation, which is to say, an equivalence relation on a subset of the natural 
numbers. 

One obtains the finite type extensions HA" of Heyting arithmetic by ex- 
tending T with quantifiers and induction over arbitrary finite types. Godel's 
Dialectica interpretation provides an interpretation of HA" in T. Alternatively, 
one can show that provable formulas in HA" are realized by terms in T. Thus, 
whenever HA" proves Vx 3y (p(x,y), where x and y are variables of any finite 
type, there is a hereditarily effective operation which computes y from x. The 
models HRO and HEO can be formalized in intuitionistic arithmetic, HA, show- 
ing that HA" is also interpretable in first-order intuitionistic arithmetic. (For 
discussions of HA", T, and models thereof, see [71 12061 [207] .) 

3.7 Bishop-style constructive mathematics 

In 1967, the American analyst, Errett Bishop, published a book, Foundations 
of Constructive Analysis |16j . which launched a new era in constructivity. In 
the preface, he wrote: 

It appears . . . that there are certain mathematical statements that 
are merely evocative, which make assertions without empirical valid- 
ity. There are also mathematical statements of immediate empirical 
validity, which say that certain performable operations will produce 
certain observable results, for instance, the theorem that every pos- 
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itive integer is the sum of four squares. Mathematics is a mixture of 
the real and the ideal, sometimes one, sometimes the other, often so 
presented that it is hard to tell which is which. The realistic com- 
ponent of mathematics — the desire for pragmatic interpretation — 
supplies the control which determines the course of development 
and keeps mathematics from lapsing into meaningless formalism. 
The idealistic component permits simplifications and opens possi- 
bilities which would otherwise be closed. The methods of proof and 
objects of investigation have been idealized to form a game, but the 
actual conduct of the game is ultimately motivated by pragmatic 
considerations. 

For 50 years now there have been no significant changes in the rules of 
this game. Mathematicians unanimously agree on how mathematics 
should be played. . . 

This book is a piece of constructivist propaganda, designed to show 
that there does exist a satisfactory alternative. 

This was a landmark in the history of constructive mathematics. Brouwerian 
intuitionism relied not only on a vocabulary that is foreign to most working 
mathematicians, but also on principles, such as the continuity of every function 
defined on [0, 1], that are not classically valid. Similarly, the Russian school's 
explicit restriction to computable objects sets it apart from contemporary math- 
ematics. A central feature of Bishop's constructive mathematics is that the 
theorems are classically valid, and, indeed, look like ordinary mathematical the- 
orems. The point, however, is that they are established in such a way that every 
theorem has computational significance. This is achieved by stating definitions 
and theorems carefully, restricting generality ("avoiding pseudogenerality," in 
Bishop's words), and adhering to methods that retain computational meaning. 

Foundations thus began with an informal statement of constructive set- 
theoretic principles. To start with, "a sequence is a rule which associates to 
each positive integer n a mathematical object a„." Then: 

The totality of all mathematical objects constructed in accord with 
certain requirements is called a set. The requirements of the con- 
struction, which vary with the set under consideration, determine 
that set. Thus the integers are a set, the rational numbers are a set, 
and the collection of all sequences each of whose terms is an integer 
is a set. Each set A will be endowed with a relation = of equality. 
This relation is a matter of convention, except that it must be an 
equivalence relation. 

And, as far as functions are concerned: 

The dependence of one quantity on another is expressed in the basic 
notion of an operation. An operation from a set A to a set B is a 
rule / which assigns an element f(a) of B to each element a of A. 
The rule must afford an explicit, finite, mechanical reduction of the 
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procedure for constructing /(a) to the procedure for constructing 
a. . . The most important case occurs when 

f(ai) = /(oa) 

whenever a\ and are equal elements of A. Such an operation / is 
called a function. 

Although Bishop did not provide a formal axiomatic foundation, it is reason- 
able to seek a formal interpretation of this framework. One option is to use a 
system like HA", and interpret each set as a definable subset of a type with a 
definable equivalence relation; that is, take the membership relation x G A to 
be given by a formula <pa{x) and take equality x —a V to be given by another 
formula ip =A (x,y). The interpretations of HA" discussed in the last section 
then give Bishop-style constructive mathematics a direct computational inter- 
pretation. Bishop-style mathematics can also be developed in constructive type 
theory (which will be discussed below) viewing sets as types equipped with an 
equivalence relation (a.k.a. "setoids"). 

Bishop-style constructive mathematics is, in a sense, the most "pure" (or, at 
least, minimal) constructive approach discussed so far, in that it does not rely 
on bar induction or the continuity principle from intuitionistic mathematics, 
nor Markov's principle from constructive recursive mathematics. This makes 
the framework more appealing to classical mathematicians. For example, we 
have seen that the Russian school defines a real number to be a computable real 
number, and a function from reals to reals to be computable in an appropriate 
sense. This means that classical mathematicians cannot view their theorems 
about real numbers as such; they have to be interpreted as saying something 
more restrictive. Similarly, the fact that, in a Brouwerian framework, all func- 
tions on the real numbers are continuous shows that the Brouwerian notion of 
function departs from the classical one. In contrast, even though it is consis- 
tent with Bishop-style mathematics to think of all real numbers and functions 
as being computable, the framework is fully consistent with ordinary classical 
mathematics. In other words, the theorems in the framework can be viewed 
as contemporary mathematical theorems, proved in such a way that the results 
have additional computational significance. 

Bishop's work was continued by a number of people, including Douglas 
Bridges, Ray Mines, Fred Richman, and many others, and remains a mainstay 
of modern constructivity (see, for example, [T71 \31\ ). 

3.8 Intuitionistic higher-order arithmetic 

The finite types, discussed in Section [3~B1 trace back to Gottlob's Frege founda- 
tional system |63] , which was built on a single base type of individuals. They 
made their way, in modified form, into the ramified type theory of Russell and 
Whitehead's Principia Mathematica 185\, and ultimately into Alonzo Church's 
formulation of higher order logic as simple type theory [40] . All of these systems 
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include some form of a scheme of comprehension, 

3XVy(yeX^ <p(y)), 

which asserts that any formula ip with a free variable y of type a gives rise to a 
set, or predicate, X, of type a —> Bool. (More precisely, Frege's "extensions" of 
formulas stood as proxy for such objects.) 

All of the systems just described are based on classical logic, but one can 
just as well consider intuitionistic versions, for example, adding comprehension 
axioms to HA". The result is intuitionistic higher-order logic, or IHOL. From a 
logical perspective, such a system is much stronger than HA". One reason to 
be interested in such a system is that it represents the internal logic of a topos, 
the algebraic structure that Alexander Grothendieck used to study sheaves over 
a space (see, for example, |146| ). 

It is perhaps a matter of debate whether intuitionistic higher-order logic de- 
serves to be called "constructive." But one thing that speaks in favor of this is 
that one can give IHOL a computational interpretation. In fact, this can be done 
in various ways, paralleling the various ways of giving a computational inter- 
pretation to Godel's T. For example, one can define an explicit normalization 
procedure which reduces proofs of IHOL to a canonical normal form; methods 
based on Girard's candidats de reducibilite [7H[7S] show the reduction procedure 
always terminates. Alternatively, one can give a realizability interpretation by 
interpreting IHOL in Martin Hyland's effective topos |101j . 

3.9 Constructive type theory 

At present, the predominant foundational frameworks for constructive mathe- 
matics take the form of constructive type theory. Such frameworks unify two of 
the foundational trends we have seen so far: 

• type theory, in the Frege-Russell-Church-Godel tradition; and 

• the notion of explicit "evidence" for a constructive claim, in the tradition 
of the BHK interpretation and realizability. 

It is the use of dependent types that makes this unification possible. 

In simple type theory, types cannot depend on parameters. For example, 
given a type A and a fixed natural number n, one can form the type A™ of 
n-tuples of elements of A, but one cannot view these as a family of related 
types that depend on the parameter n. In other words, one cannot view A" 
as a type that depends on a variable n of type N. This is exactly the sort of 
thing that dependent type theory is designed to support. To start with, the 
type A — > B of functions which take an argument in A and return a value in B 
can be generalized to a dependent product Y[ x a B(x), where B(x) is a type that 
can depend on x. Intuitively, elements of this type are functions that map an 
element a of A to an element of B(a). When B does not depend on x, the result 
is just A — > B. Similarly, product types A x B can be generalized to dependent 
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sums J2x A Intuitively, elements of this type are pairs (a, b), where a is an 

element of A and b is an element of B(a). When B does not depend on x, this 
is just A x B. 

The second conceptual innovation in constructive type theory is to internal- 
ize the notion of constructive evidence. According to the BHK interpretation, 
knowing a mathematical theorem amounts to having a construction that real- 
izes the claim. Thus we can view any mathematical proposition as specifying 
a special type of data, namely, the type of construction that is appropriate to 
realizing it. This has come to be known as the "propositions as types correspon- 
dence" or the "Curry- Howard correspondence" [351 113 HE] > developed by Curry, 
Howard, Tait, Martin-L6f, and Girard, among others. The point is that logical 
operations look a lot like operations on datatypes. For example, in propositional 
logic, from A and B one can conclude A A B. One can read this as saying that 
given a proof a of A and a proof b of B of B one can "pair" them to obtain 
a proof (a, b) of A A B. In other words, A A B and A x B are governed by the 
same rules. Similarly, the interpretation of implication A — > B mirrors the rules 
for function types: giving a proof of A — > B amounts to constructing a function 
from A to B. In the same way, a proof of Vx : A B(x) is a function which, given 
any a in A, return a proof of B(a). 

Thus, in constructive type theory, some types are naturally viewed as types 
of data and some are naturally viewed as propositions, but the two interact 
and are governed by similar rules. A single calculus gives the rules for defining 
mathematical objects and proving propositions; that is, the calculus provides a 
set of rules for carrying out mathematical constructions of both sorts. Two of 
the most commonly used systems today are Martin-Ldf type theory [151j . and 
the calculus of inductive constructions [43], which extends the original calcu- 
lus of constructions due to Coquand and Huet The relationship between 
the Martin-L6f type theory and the calculus of constructions is similar to the 
relationship between HA" and intuitionistic higher-order logic: the calculus of 
constructions has "imprcdicative" comprehension principles that render it much 
stronger than Martin-Lof's predicative counterpart. 

As was the case with HA" and intuitionistic higher-order logic, one can 
give these systems a computational interpretation in various ways. Indeed, 
semantics for constructive type theory draws on the full range of the theory of 
programming language semantics, making use of realizability interpretations, 
strong normalization proofs, domain theory, and more. The literature on this 
subject is extensive; see, for example, [4] [44] . 

3.10 Computational interpretations of classical theories 

We characterized the Russian school of constructive recursive mathematics as 
reasoning about computable objects in a constructive way. One can maintain 
the first requirement axiomatically while giving up the second: for example, 
the subsystem of second-order arithmetic known as RCAo is a classical system 
for which first-order quantifiers can be interpreted as ranging over the natural 
numbers and second-order quantifiers can be interpreted as ranging over com- 
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putablc sets and functions. Thus RCAo is a reasonable setting for formalizing 
classical computable analysis (see |198| ). 

To what extent can one preserve a computational interpretation of quantifier 
dependences in a formal system that includes the law of the excluded middle? 
There is a trivial sense in which any reasonable classical theory has a computa- 
tional interpretation. Let tp be a II2 statement, that is, an assertion of the form 
\/x By R(x, y) where x and y range over natural numbers and R is a primitive 
recursive relation. Suppose some classical theory T proves tp. Then, assuming 
T can be trusted in this regard, tp is true, which means that a simple-minded 
computer program that, on input x, searches systematically for a y satsifying 
R(x, y) is guaranteed to succeed. 

There is also a fundamental sense in which such an interpretation cannot be 
extended to II3 sentences. Let T(e, x, s) be Kleene's primitive recursive relation 
that expresses that s is a halting computation sequence for Turing machine e 
on input x. Then the classically valid statement that any given Turing machine 
e either halts on input or doesn't can be expressed as follows: 

Ve 3s Vs' (T(e, 0, s) V ->T(e, 0, s')). 

But any function mapping an e to an s satisfying the conclusion provides a solu- 
tion to the halting problem, and thus there is no computable function witnessing 
the Ve 3s dependence. 

Nonetheless, one can often give classical logic an indirect computational in- 
terpretation. One way to do this is to interpret a classical theory in a construc- 
tive one, using the double-negation translation and tricks such as the Friedman- 
Dragalin translation [HI [65] or the Dialectica interpretation j77j [7] to recover 
LI2 theorems (see also JUST]). One can also provide more direct computational 
interpretations, such as realizability relations of various sorts. Griffin [81] has 
shown that classical logic can be understood in terms of a standard semantics for 
programming languages with control operators, such as exceptions. This com- 
putational interpretation is captured by the A/it-calculus designed by Parigot 
|168j (see also [200 ). Chetan Murthy |162| has shown that this interpretation 
can be seen as the result of combining a double-negation translation with the 
Friedman-Dragalin trick, and then using intuitionistic realizability. Jean-Louis 
Krivine [127] has provided a realizability interpretation for full classical set the- 
ory. 

Sometimes what one wants from a classical proof is not a computational 
interpretation per se, but useful computational or quantitative information that 
is hidden by classical methods. In the 1950's, Kreisel spoke of "unwinding" 
classical proofs to obtain such information, a program which has developed 
under the heading of "proof mining," by Kohlenbach and others |119j . 
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